Efficient and Effective Visual Codebook Generation Using Additive Kernels

نویسندگان

  • Jianxin Wu
  • Wei-Chian Tan
  • James M. Rehg
چکیده

Common visual codebook generation methods used in a bag of visual words model, for example, k-means or Gaussian Mixture Model, use the Euclidean distance to cluster features into visual code words. However, most popular visual descriptors are histograms of image measurements. It has been shown that with histogram features, the Histogram Intersection Kernel (HIK) is more effective than the Euclidean distance in supervised learning tasks. In this paper, we demonstrate that HIK can be used in an unsupervised manner to significantly improve the generation of visual codebooks. We propose a histogram kernel k-means algorithm which is easy to implement and runs almost as fast as the standard k-means. The HIK codebooks have consistently higher recognition accuracy over k-means codebooks by 2–4% in several benchmark object and scene recognition data sets. The algorithm is also generalized to arbitrary additive kernels. Its speed is thousands of times faster than a naive implementation of the kernel k-means algorithm. In addition, we propose a one-class SVM formulation to create more effective visual code words. Finally, we show that the standard kmedian clustering method can be used for visual codebook generation and can act as a compromise between the HIK / additive kernel and the k-means approaches.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Image Compression with Efficient Code Book Initialization Using Lbg Algorithm Image Compression with Efficient Codebook Initialization Using Lbg Algorithm

Vector quantization (VQ) has received a great attention in the field of multimedia data compression since last few decades because it has simple decoding structure and can provide high compression ratio. In general, algorithms of VQ codebook generation focus on solving two kinds of problem: (i) to determine the quantization regions and the code words that minimize the distortion error. (ii) to ...

متن کامل

An Efficient and Effective Method for VQ Codebook Design

Although there have been many methods proposed to speed up the VQ codebook generation process, it turns out to be the case that these methods either suffer from slight degradation in overall distortion or just maintain the same quality with hardly time saving. In other words, the methods proposed to generate better codebooks are often inefficient, and those which are fast tend to have worse qua...

متن کامل

Image Compression with Efficient Codebook Initilization Using Lbg- Optimization Algorithm

In this paper we present a very simple and yet effective algorithm to generate good codebook. In general VQ codebook generation algorithm focus on solving two problem(i)To reduce the computational complexity of code words search the building the codebook. (ii) Extra Computational overhead to calculate the measurement of codeword in codebooks..in this paper, a novel VQ codebook generation method...

متن کامل

Bilevel Visual Words Coding for Image Classification

Bag-of-Words approach has played an important role in recent works for image classification. In consideration of efficiency, most methods use kmeans clustering to generate the codebook. The obtained codebooks often lose the cluster size and shape information with distortion errors and low discriminative power. Though some efforts have been made to optimize codebook in sparse coding, they usuall...

متن کامل

Metric Learning in Codebook Generation of Bag-of-Words for Person Re-identification

Person re-identification is generally divided into two part: first how to represent a pedestrian by discriminative visual descriptors and second how to compare them by suitable distance metrics. Conventional methods isolate these two parts, the first part usually unsupervised and the second part supervised. The Bag-of-Words (BoW) model is a widely used image representing descriptor in part one....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of Machine Learning Research

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2011